An efficient algorithm for Web usage mining

نویسندگان

  • Florent Masseglia
  • Pascal Poncelet
  • Rosine Cicchetti
چکیده

With the growing popularity of the World Wide Web (Web), large volumes of data are gathered automatically by Web servers and collected in access log files. Analysis of server access data can provide significant and useful information. In this paper, we address the problem of Web usage mining, i.e. mining user patterns from one or more Web servers for finding relationships between data stored [COO 97], and pay particular attention to the handling of time constraints [SRI 96]. We adapt a very efficient algorithm for mining sequential patterns in the “market-basket” approach [MAS 98], to this particular context. RÉSUMÉ. Avec la popularité du World Wide Web (Web), de grandes quantités d’information sont automatiquement collectées par des serveurs Web et stockées dans des fichiers access log. L’analyse de ces fichiers peut fournir des informations pertinentes et utiles [COO 97]. Dans ce papier nous abordons le problème de l’analyse du comportement des utilisateurs avec une attention particulière à la prise en compte de contraintes de temps [SRI 96]. Nous adaptons un algorithme efficace de recherche de motifs séquentiels [MAS 98] pour découvrir des corrélations dans les données issues de serveurs Web.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Mining of Cross-Transaction Web Usage Patterns in Large Database

Web Usage Mining is the application of data mining techniques to large Web log databases in order to extract usage patterns. A cross-transaction association rule describes the association relationships among different user transactions in Web logs. In this paper, a Linear time intra-transaction frequent itemsets mining algorithm and the closure property of frequent itemsets are used to mining c...

متن کامل

Web Service Usage Mining: Mining For Executable Sequences

As service world becomes bigger, behavior of people in this world becomes interesting and analysis of usage sequences can yield useful information about web services and the way they are used. Application of data mining and web mining techniques in the field of web services is introduced in order to discover interesting patterns among web services usage and interactions. In this paper, we intro...

متن کامل

Efficient Proxy Server Caching Using Web Usage Mining Technique on Web Logs - for Improving Hit Rate and Response Time

This paper presents a vertical application of web usage mining: efficient web caching for improving the response time , for the internet users ,specially due to increase in number of users of e-commerce on the internet Introducing efficient web caching algorithms that employ predictive models of web requests; the general idea is to extend the cache replacement policies of proxy servers by makin...

متن کامل

Minimizing the Repeated Database Scan Using an Efficient Frequent Pattern Mining Algorithm in Web Usage Mining

Data Mining, is the process of discovery of new patterns and knowledge from large dataset. Web mining is the application of data mining techniques to extract and mine useful knowledge and interesting patterns from World Wide Web .Web data including web documents, hyperlinks between documents, usage logs of web sites. The web usage data captures the identity and origin of the web user along thei...

متن کامل

Mining Constraint-based Multidimensional Frequent Sequential Pattern in Web Logs

In this paper we introduce an efficient strategy for discovering Web usage mining is the application of data mining techniques to discover usage patterns from Web data, in order to understand and better serve the needs of Web-based applications. Web usage mining consists of three phases, namely preprocessing, pattern discovery, and pattern analysis. This paper describes each of these phases in ...

متن کامل

An Efficient Algorithm for Data Cleaning of Log File using File Extensions

World Wide Web is a monolithic repository of web pages that provides the Internet users with heaps of information. With the growth in number and complexity of Websites, the size of web has become massively large. Web Usage Mining is a division of web mining that involves application of mining techniques to web server logs in order to extract the behavior of users. A Web Usage Mining process com...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999